Lessons Learned - The Case of CROCUS: Cluster-Based Ontology Data Cleansing
نویسندگان
چکیده
Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which prevents their direct use in productive systems. Hence, (semi-)automatic quality assurance processes are needed as manual ontology repair procedures by domain experts are expensive and time consuming. In this article, we present CROCUS – a pipeline for cluster-based ontology data cleansing. Our system provides a semi-automatic approach for instance-level error detection in ontologies which is agnostic of the underlying Linked Data knowledge base and works at very low costs. CROCUS has been evaluated on two datasets. The experiments show that we are able to detect errors with high recall. Furthermore, we provide an exhaustive related work as well as a number of lessons learned.
منابع مشابه
CROCUS: Cluster-based Ontology Data Cleansing
Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which preven...
متن کاملJoint proceedings of Second International Workshop on Semantic Web Enterprise Adoption and Best Practice ( WaSABi 2014 ) & Second International Workshop on Finance and Economics
Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which preven...
متن کاملRegionalization of the Iowa State University Extension System: Lessons Learned by Key Administrators
The cyclical economic downturn in the United States has forced many Extension administrators to rethink and adjust services and programming. The Cooperative Extension System (CES), the organization primarily responsible for governmental Extension work in the United States, at Iowa State University responded to this economic downturn by restructuring its organization from county based to a regio...
متن کاملPsychosocial Rehabilitation: Some Lessons Learned From Natural Disaster in Iran
Background: Disasters have adverse impacts on different aspects of human life. Psychosocial Rehabilitation is one of the fields which is usually overshadowed and ignored by physical rehabilitation or its importance does not receive proper attention. This research attempts to study some lessons learned from Psychosocial Rehabilitation based on disaster experiences in Iran. M...
متن کاملRebirth of a city lessons learned from post disaster reconstruction the case study: Rofayye\'
After disasters, one of the main challenges confronting authorities is site selection for reconstructing damaged structures. Experiences indicate that appropriate policies in site-selection could greatly influence on reconstruction success and residents' satisfaction. Meanwhile, in literature related to post disaster reconstruction, avoiding from relocating settlements is generally emphasize...
متن کامل